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© Traffic control for ATM networks. 

© An asynchronous transfer mode (ATM) traffic control framework is based on an integrated usage parameter 
control (UPC) approach, which approach provides a unified and scalable solution to the issue of quality-of- 
services (QOS) levels over a range of anticipated services in ATM based networks. The approach is consistent 
with emerging ATM Forum and CCITT standards. Additionally, a UPObased call and burst admission control 
providing the desired QOS over periods of network overload by cai l/burst admission control and traffic shaping 
of source stream preferably uses a dual leaky bucket 
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The present invention relates to an asynchronous transfer mode (ATM) traffic control framework based 
on an integrated usage parameter control (UPC) approach. Specifically, the UPC approach provides a 
unified and scalable solution to the issue of quality-of-service (QOS) levels both over a range of anticipated 
services and during periods of network overload. 

5 Future broadband networks will be required to support new bandwidth intensive applications such as full 
motion video, teleconferencing, interactive multimedia, medical imaging, scientific visualization, distributed 
computing, distributed simulation, client server traffic, etc. These applications will place varying require- 
ments in terms of the quality-of-service they receive from the network. Asynchronous transfer mode (ATM) 
technology is now viewed as the primary switching and transport technology for emerging high-speed 

w networks, both in wide-area and local-area scenarios. While ATM offers the ability to support a range of 
multi-rate services in an integrated manner, technical approaches for maintaining quality-of-service in 
dynamic and heterogeneous traffic environments require further development and standardization. 

In spite of the the promise of abundant bandwidth, ATM may not be able to satisfactorily support many 
QOS sensitive services unless appropriate dynamic traffic control is exercised by the network. The 

75 underlying philosophy of ATM networking is fast generic cell transport with simple switching nodes that 
scale easily with line speed and number of ports. The cell-level control in these nodes is generally quite 
minimal to avoid complex high-speed processing. Application-specific switching node architectures are 
considered to be neither desirable nor necessary. The lack of strong cell level controls at each ATM 
switching node, coupled with the fact that ATM based switches have a limited amount of buffering (due to 

20 technological as well as scaling limitations), can result in traffic hot spots and other congestion related 
problems within the network. Further, the cell-based nature of the transport can result in server performance 
degradation under overload due to cell loss and consequent retransmission of data. 

The applications anticipated in broadband networks have traffic characteristics that are very different 
from those of current applications. Many of these applications require rapid movement of large quantities of 

25 data across the network, which can overwhelm the buffers in the ATM switches, resulting in cell loss. 
Further, the cell-based nature of the transport can result in severe performance degradation under overload 
due to cell loss and consequent retransmission of data. While providing more memory in the network may 
temporarily alleviate the problem, it does not lead to a scalable solution. For example, Fig. 1 shows the 
performance of an ATM multiplexer (without link level controls), that multiplexes traffic from 16 FDDI 

30 routers. Each router has an access rate of 100 M bits/second to the ATM multiplexer, and the FDDI packets 
have an average size of 4 Kbytes. Even at utilizations as low as 60%, and with a buffer size of 200 cells, 
the frame loss rate is of the order of 1.0 x 10"\ which would be unacceptable for most data applications. 
With a larger buffer size of 1000 cells the frame loss rate is of the order of 1.0 x 10~ 3 . Thus additional 
buffers do not mitigate the problem. Controlled movement of data across the network is essential to ensure 

35 high throughput and low loss. 

The present invention concerns a traffic control framework for ATM, that would maintain specified QOS 
levels over a range of anticipated services. Central to this framework is a classification scheme that maps 
applications to specific traffic classes based on their QOS requirements and the statistical characteristics of 
their traffic. Attached to each traffic class is a minimum QOS guarantee. The primary objective of the 

40 proposed traffic control framework is to ensure that the desired QOS is maintained during periods of 
network overload, both short and long-term. The framework is based on a pro-active philosophy of 
controlling sources (via UPObased traffic shaping) to avoid network congestion, while still operating the 
network in the low loss/high utilization region. The UPObased control depends upon a minimum set of 
control functions to handle a wide range of traffic types. Further, the same set of traffic parameters is used 

45 to describe the traffic characteristics of all classes. Specifically, this aspect of the invention uses cell-based 
admission control policy based on the UPC parameters. The framework however is backed by reactive 
controls to protect the network, in the event an occasional short-term traffic overload occurs. The proposed 
traffic control framework can work across LANs, MANs, WANs, private and public networks in a seamless 
manner. The controls are designed to scale with distance, traffic volume and traffic type. 

so A principal feature of this framework is that it depends on a minimum set of control functions to handle 
a wide range of traffic types. Further, the same set of traffic parameters is used to describe the traffic 
characteristics of all classes. 

Another important feature of this framework is that it only uses parameters for control that have already 
been standardized by CCITT and the ATM Forum without needing more statistical information from the 

55 users. 

The present invention concerns only one possible approach, and it will be apparent to those skilled in 
the art that many alternative approaches ranging from no control to per virtual circuit (VC) control are now 
under discussion in standards forums. It is recognized that traffic shaping at each UNI (user network 
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interface) may be considered too expensive in certain LAN or private WAN scenarios, which may choose to 
rely on mechanisms such as link-by-link back-pressure using large buffers or backward error congestion 
notification (BECN) indication. However, this approach (or variations of it) may become necessary to 
address the issue of interfacing private ATM networks to public ATM/B-ISDN, in which traffic shaping and 
5 QOS guarantees are expected to be mandatory. 

Further features of the invention will become more clearly apparent when the forthcoming description is 
read in conjunction with the accompanying drawings. 

Figure 1 is a graphical representation of the upper bounds on cell and frame loss probabilities for FDDI 
inputs; 

70 Figures 2(a) and 2(b) are graphical representations of the input and output signals respectively of dual 
leaky bucket shaper; 

Figure 3 is a schematic representation of a UPC based burst level control in an ATM network; 
Figure 4 is a graphical representation of an output profile of a dual leaky bucket; 
Figure 5(a) is a graphical representation for an output profile of a two state model of a dual leaky bucket; 
75 and 

Figure 5(b) is a graphical representation of an output profile of a modified two state model of a dual 
leaky bucket in accordance with the present invention. 

The ATM network will be able to provide guarantees on the offered quality of service only if it has a 
reasonable description of characteristics of the offered traffic. Hence traffic descriptors that describe the 

20 characteristics of the offered traffic, and policing mechanisms that ensure conformance with the declared 
traffic parameters are essential. Traffic classification simplifies the task of QOS management. It also helps 
to provide various levels of performance guarantees based on the applications need. Once the characteris- 
tics of the source and their required QOS are known, the network can use appropriate admission control 
policies to accept or deny service to new requests wanting to use the services of the network. 

25 In the framework under consideration traffic classification is based on traffic characteristics of the 
offered load and QOS requirements. The framework is based on the assumption that traffic management on 
a per-class basis is more realistic than traffic management on a per connection or per virtual channel basis. 
It is assumed that the service provider will define a set of M classes of service. Each class of service is 
targeted towards a set of specific applications, whose traffic characteristics and QOS requirements from the 

30 network are similar. Associated with each class m(m = 1,2,..., M) \s a traffic parameter vector a m = (a m1 , 
and a QOS vector /3 m = fj9 m1 , $„&,...). The traffic parameter represents the traffic characteristic of 
sources that subscribe to a given service class. Elements of the traffic parameter vector will be quantities 
such as the peak rate, burst size, etc. Preferably, the traffic parameter vector is assumed to be the same as 
the traffic shaping vector defined by the ATM Forum. This is based on the dual-leaky bucket filter. The 

35 QOS vector represents the guarantees that are offered on a particular service. These guarantees are 
statistical, and are offered on a class basis rather than on an individual connection basis. Elements of the 
QOS vector will be network related performance measures such as cell loss, delay, etc. The number of 
such service classes should be sufficiently inclusive to encompass all foreseeable needs of customers, but 
small enough to keep the problem simple and manageable. Associated with each service class offering is a 

40 qualitative description of the service, the range of applications it can handle, the parameters that would 
describe the source characteristics and how to choose them, and the maximum value the parameters can 
assume for the given class (i.e., 

45 

The selection of a particular traffic class for a given application is performed with the help of an 
application programming interface (API). Once a service class is chosen by a source, the network monitors 
the source for the declared traffic parameters (if one has been declared) or monitors the maximum or 

so default parameter values of the chosen service (if no parameters were declared), and takes appropriate 
action on those sources exceeding this constraint. As long as the source parameters are within the 
maximum or default values, the desired quality of service is guaranteed (in fact it could be better but never 
worse). If the user chooses the wrong class, then there may be a degraded level of performance, at which 
point the user would make an alternative choice. The principal advantage of this approach is that the users 

55 are relieved of the burden of precisely characterizing their sources. For the network provider, the result is a 
more manageable problem. Further, policing, scheduling, and resource allocation policies can be tuned for 
each class, and thereby improve network efficiency while maintaining the required QOS. 
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Based on the anticipated traffic characteristics and the performance expected from the network by 
future applications, five generic service classes for ATM are considered. While each of these traffic classes 
is designed to handle a range of traffic parameters, they provide performance guarantees on a statistical 
basis common to all the traffic that subscribe to a given class. This approach makes the traffic more 

5 predictable from the network point of view. Once the traffic becomes predictable (by imposing appropriate 
constraints - in terms of shaping the traffic, and regulating their admission), it is possible to devise 
scheduling mechanisms and resource management strategies that ensure that the desired QOS for all 
service classes are met while network resources are efficiently used. Table 1 lists the five traffic classes 
along with typical applications and their QOS requirements. Class 1 and class 2 are connection oriented 

io services that can support services with real-time requirements, such as voice and video applications. Class 
3 to 5, are intended to support data applications that do not have the same constraints as real time traffic 
(class 1 and 2). Note that the last three classes can support both connection oriented as well as connection- 
less services. 

15 Table 1 



Traffic Class 


Traffic Type 


Application 


QOS Requirements 


Class 1 


CBR 


Voice, video, ... 


Isochronous 


Connection-oreiented (CO) 




low cell loss 
low delay 
low delay jitter 


Class 2 


VBR 


Video, multimedia 


Low delay 


Connection-oriented 




low delay jitter 
moderate cell loss 
statistical Mux 


Class 3 


Statistical data traffic 


interactive data 
client server app. 


low cell loss 
moderate latency 


Connectionless (CL) 


Best effort service 




Class 4 


Low latency 


Distributed computing 
distributed simulation 


low loss 
low latency 


CL 


remote paging 


high throughput 


Class 5 


Bulk data 


File transfers 
Image retrievals 


High throughput 
low loss 


CO/CL 




Moderate latency 



The Control framework is also based on a set of temporal controls that are hierarchically arranged 
based on the time constants of events controlled. These can be classified as: call level controls, burst level 

45 controls, and cell level controls. The controls at the call level and burst level control admission of new calls 
or new bursts. Their control parameters are based on the dual leaky bucket parameters recommended by 
the ATM forum for implementation at the public user network interface (UNI). Their main function is to 
shape traffic and check conformance with the declared UPC parameters in order to ensure the required 
QOS requirements can be met by the network. The controls are based on a fast resource allocation protocol 

50 [FRP] to ensure low latency and high utilization. 

The cell level controls may include: an aggregate back pressure control between switches (on a per 
class basis), a per virtual channel (VC) control based on the FECN [forward error congestion notification] bit 
in the ATM header, dass based scheduling, buffer and bandwidth management policies at each switch, or 
fire walls between traffic classes to minimize interaction. This assumes that the bandwidth is partitioned 

55 dynamically between the different classes of traffic, ft is also possible to use some form of asynchronous 
time sharing of bandwidth at the cell level between the different traffic classes to ensure that the QOS of 
each class is guaranteed and the resource utilization is kept high. The QOS guarantees are provided on a 
class basis rather than on an individual connection basis. 
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The present invention is primarily concerned with the use of the UPC function at the UNI (and the 
complementary shaping function at the CPE) to perform the following functions: 

1. A Call Admission Control (CAC) policy based on resource allocation and a guaranteed QOS for a class 
of traffic. 

5 2. Fast Burst Access Control (BAC) to accept or deny bursts on a burst-by-burst basis and guarantee a 
given QOS for certain classes of traffic. 

3. Allocation and de-allocation of network resources based on the state of the virtual channels (VC) 
connection by using a simple extension to the standard dual-leaky bucket implementation. This results in 
better statistical multiplexing gains in the network. 

w The usage parameter control function (UPC) is implemented at the public network interface to 
guarantee QOS. It is assumed that such functions will also be implemented at the user network interface 
(UNI). Adding this function at the UNI does not significantly increase the complexity of the interface. For 
example, in U.S. Patent Application serial no. 08/097,683 filed July 27,1993 entitled "Control Architecture for 
ATM Networks" and assigned to the same assignee as the present invention, which application is 

75 incorporated herein by reference, there is described an implementation scheme which involves adding one 
extra counter to a standard dual leaky-bucket implementation to do the burst-access control activation and 
de-allocation of resources. In a RAM-based implementation this translates to adding an extra state 
information field to the width of the RAM (8-16 bits) per VC. The UPC function would have to be 
implemented at the UNI in any case when the host machine needs direct B-ISDN connection capability. 

20 The present control strategy, which is based on the UPC function, results in a natural call admission 
control (CAC) as well as a burst admission control (BAC) policy. Further, this UPC based control strategy 
provides a common control framework for both connection oriented traffic such as voice and variable bit 
rate video and data, as well as connectionless traffic. The control can provide statistical guarantees on ceil 
loss, or even guarantee arbitrarily low cell loss via appropriate choice of control parameters. 

25 A dual leaky bucket is defined by three parameters, namely: the peak rate \* the sustained rate X s , and 
the compliant burst size B c - Once a source declares these parameters, the dual leaky bucket implementa- 
tion of the UPC function at the UNI ensures that the source transmits no more than B c cells at the peak rate 
\ P , As shown in Figures 2(a) and 2(b), if the burst size is larger than the compliant burst size B c , then the 
first B c cells are transmitted at the peak rate of Xp. The remaining cells in the burst are only transmitted at 

30 the sustained rate X s which can at most be equal to the peak rate. Hence, once the three parameters of the 
UPC function are known, the switch can determine a worst case traffic profile for the offered traffic. Based 
on this worst case traffic profile, the switch can decide to either accept the call, reject the call, or accept the 
call after suitably modifying the declared UPC parameters. 

Figure 3 is a schematic representation of a UPC based burst level control in an ATM network. For 

35 example, consider a link where N calls/bursts are in progress. Call /(/= 1 ,2, . . . ,N) has UPC parameters Xp, 
\' s and Be. Let L be the number of buffers at the switch port that has a link capacity C. Upon receipt of a 
request for the admission (request to send (RTS) message) of a new call or a burst (from a call that has 
already been established), with UPC parameters 

40 

Ap , 



45 



50 



and 



it is possible to determine the new cell loss probability on a per call basis, if this call is admitted. The 
new call can be accepted if it does not compromise the QOS for calls already in progress. If the new call or 
burst cannot be admitted because the QOS for existing calls will be compromised, it is possible to compute 
55 a new set of UPC parameters XJ, XJ and B* c which is provided by the CTS (cleared to send) message 
signal, that will allow the new call or burst to be be admissible. The subsequently transmitted burst data 
conforms to the new set of UPC parameters Xp, XJ and fl£ 
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and the burst data is admitted. Hence, through the use of the UPC parameters, it is possible to control the 
cell loss by controlled admission of new calls at the call level and new bursts at the burst level. 

Referring to Figure 3, burst mode terminal 10 provides to switch controller 12 a RTS (request to send) 
message containing the parameters peak rate X* sustained rate X s , and compliant burst size B c of the burst 

5 data packet to be sent. Switch controller 12 determines whether the network (not shown) can handle the 
data. If the network is able to handle the data, the burst data is connected to the network via ATM switch 
14. If the network is unable to handle the data with the proposed parameters, the switch controller sends a 
CTS (cleared to send) signal to the burst mode terminal including revised parameters x;, \* s and ££ which if 
the burst data packet conformed to these revised parameters, the data would be admitted to the network via 

w ATM switch 14. Subsequent burst data is transmitted from burst mode terminal 10 conforming to the 
revised parameters. 

The switch controller 12 as described below considers the class of traffic, the guaranteed QOS, and the 
existing traffic of the network in determining the revised parameters. Alternatively, the burst mode terminal 
can send burst data, without an RTS message, in which case the switch controller must determine whether 
75 the burst data can be admitted or whether the parameters must be changed. 

In a variation of the above described negotiation method of changing the parameter values of the burst 
data, it is possible for the burst mode terminal to select new parameter values, without receiving revised 
parameters from the switch controller, when the data is not admitted by the switch controller. Such an 
arrangement, while possible, is undesireable since unlike the situation when the revised parameters are 
20 provided, there is no certainty that the new parameters selected by the burst mode terminal will result in the 
data being admitted, thereby further delaying admission of the data to the network. 

The "neogtiation" where the switch controller provides revised parameter values which if used by the 
burst mode terminal will assure admission of the data to the network, provides the most timely method of 
adjusting the burst data parameters. 
25 As mentioned above, the control can be implemented at the call level as well as at the burst level 
based, on the traffic class. If the control is implemented at the burst level also, then resources are 
deallocated if the source or VC becomes inactive. When a source becomes active again, the resources are 
restored through a fast reservation protocol (FRP). 

The proposed framework uses the same UPC parameters for all the five traffic classes. However, by 
30 proper choice of control parameters it is possible to either enable or disable the controls. This makes the 
implementation simple since it is necessary to be concerned with only a single qualitative control 
mechanism for all traffic classes. Associated with this control is a timeout function with timeout parameter 
T 0t which is used to detect inactivity of sources or VCs so that resources allocated to these VCs can be 
deallocated in the event the VC becomes idle. 
35 1 . Class 1 Traffic: Isochronous or CBR traffic. 

This class consists of isochronous traffic whose QOS requirements include low delay, low delay jitter and 
low cell toss. For this traffic class, we set X P = X s and flfc = 1 and the timeout parameter T 0 = a> - That 
is, a class 1 VC is treated as generating one long burst, with a deterministic cell arrival rate of \p. Thus, 
the burst level control is never invoked. In this case the call admission control admits calls based on its 
40 peak rate, and the bandwidth available to this traffic class. However, the same resource allocation 
scheme as the one used at the burst level is used here. 
2. Class 2 Traffic: Stream type VBR traffic. 

This class has the QOS requirements of low delay, low delay jitter but can tolerate moderate cell loss. 
Class 2 traffic is generally bursty nature and hence is amenable to a certain degree of statistical 

45 multiplexing. There are a few possible approaches to mapping the QOS requirements of this class to a 
set of UPC parameters. One approach is to set parameters as for the CBR source (X P = X s ), but with 
appropriate statistical multiplexing rules at the call admission control level. A second option is to use ail 
three parameters corresponding to expected application peak rate, sustained rate and burst duration, and 
then adaptively change these parameters every To seconds. Such changes may be activated either by 

so the terminal or by the switch, in response to local and global conditions respectively. Considering for 
example VBR MPEG video, the first mode of operation simply involves selection of peak bit-rate per 
picture or group-of-pictures (GOP), depending on codec buffering capabilities. In the second mode, 
higher statistical multiplexing efficiency could be achieved by setting the peak rate to that expected for I- 
frames, sustained rate equal to that of a GOP, and T 0 equal to a GOP interval. In this case, the switch 

55 controller could monitor traffic from several such VBR sources and adaptively set parameters from GOP 
to GOP, taking into account occasional global congestion due to non-stationary behavior. This mecha- 
nism could also support burst-gaping for multiple video sources, thus providing a smoother traffic flow 
into the network. 
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3. Class 3 Traffic: Statistical data traffic. 

This class has QOS requirements of low-to-moderate cell loss, low latency, but with no stringent delay 
constraints. Such traffic streams are bursty with a low peak rate to link speed ratio, and hence are very 
amenable to statistical multiplexing. Conventional interactive/transaction data is included in this traffic 

5 class. The traffic class can also be used to provide, either connection oriented or connection-less best 
effort service, with the difference that a certain bound on the maximum cell loss can be guaranteed. This 
traffic class attempts to multiplex VCs on a statistical basis, with moderate risk of cell loss, but provides 
high utilization and low latency. In order to realize reasonable gain in statistical multiplexing, it is 
necessary to ensure that a large number of streams are being multiplexed, with the individual source 

10 intensities being a small fraction of the total capacity allocated to this class. To ensure that the conditions 
for statistical multiplexing are met, it is assumed that this class has a default set of UPC parameters, that 
are chosen to ensure the success of statistical multiplexing. When a VC chooses this class, no 
negotiation need be conducted between the source and the switch. If the offered traffic fails to conform 
to the default UPC parameters, the traffic will be automatically shaped at the user interface, thereby 

rs ensuring conformance. Nonconforming cells are tagged using the CLP bit in the ATM header. The switch 
can monitor the offered traffic in this class, and decide if new connections should be accepted or 
rejected. If the statistical gain reduces, the switch can allocate more bandwidth to this class if available. If 
queue overflow occurs, the tagged cells can be dropped first to relieve congestion. In addition a back- 
pressure mechanism is used to slow class 3 traffic from immediate upstream nodes on a link by link 

20 basis. At the same time the sources are informed to reduce their rate, either through the FECN bit in the 
ATM header, or through a backward congestion notification cell (BECN). The back-pressure control is not 
intended to propagate its effect all the way back to the source, but to only cope with congestion locally, 
until the FECN or BECN signals become effective. 

4. Class 4 traffic; Low latency - low loss traffic. 

25 This class has the QOS requirements of low latency, low cell loss and moderate to high throughput. 
Such traffic streams are also bursty but with a moderate peak to link speed ratio. Applications such as 
distributed computing and simulation, are expected to fall into this traffic class. The traffic characteristics 
and QOS requirements of this traffic class makes it difficult to achieve good statistical gain and at the 
same time ensure low loss. To circumvent this problem, statistical multiplexing with low loss to guarantee 

30 low cell loss is used. Since this can lead to low utilization of the link (i.e., a low statistical gain), resources 
are deallocated when class 4 VCs are idle. When idle VCs become active, a fast resource reservation 
protocol is used to restore the resources. This ensures that the link utilization is high while maintaining 
the cell loss and latency low. 

When a new class 4 VC originates, the VC declares its UPC parameters. The switch accepts the call 

35 with modification to the UPC parameters if necessary. During the course of a session, if the VC becomes 
idle for a time To* the resources are deallocated. The deallocated resources can now be allocated to 
other VCs. When the VC becomes active, that is, it has a new burst to transmit, it transmits the burst 
using the old UPC parameters, that it negotiated before. The switch detects the new activity, and 
determines if the old UPC parameters that the newly active VC is using can be sustained. If the old UPC 

40 parameters cannot be sustained, the switch instructs the VC to use a new set of UPC parameters. The 
VC can either use this new set of UPC parameters, or back-off and try later. Note that the burst level 
control is implicit and the control attempts to minimize the delay in admitting new burst from inactive 
VCs. In fact, the burst admission delay is a crucial QOS parameter for this traffic class. Further, since the 
individual burst sizes in this class are not likely to be large, the new UPC parameters will not effect the 

45 current burst, but will only be applied to succeeding bursts. One can view this class as one with a 
dynamic UPC renegotiation capability. Because of the fast reservation protocol, low latency, low loss and 
moderate throughput can be guaranteed to this class. In fact, it is possible to transport motion video 
frames using a burst mode of transmission in this class. This is of special interest in providing services 
such as video on demand. 

so A necessary requirement to the providing a guaranteed grade of service to this class is the notion of 

some form of fair queueing or Asynchronous Time Sharing (ATS) of bandwidth at the cell level to each of 
this traffic classes. ATS involves breaking the link bandwidth into chunks and a service discipline that 
ensures that on an average the bandwidth requirement is guaranteed for each class. For example if given 
a frame of N cells, the server ensures that each traffic class will be served for at least N f cells in a given 

55 frame or less (if class i contains less than N, cells in a given frame) in a work-conserving manner, where 
the sum of N f is equal to N. When a new class 4 VC originates, the VC declares its UPC parameters. The 
switch accepts the call and recomputes the ATS bandwidth allocated to this type at the cell level. The 
delay for this class is minimized by ensuring that enough bandwidth is allocated to this particular class 
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so that the probability of a burst being admitted is high and then using fair queueing techniques to give 
the unused bandwidth to other classes when this class is inactive. 
5. Class 5 Traffic: Bulk data transfers. 

This class has the QOS requirements of high throughput and very low cell loss but can tolerate moderate 

5 to high end-to-end delays. This class of transport is especially applicable, when the burst sizes are so 
large that the actual transmission time would be comparable to, or larger than, the admission delay. Note 
that if a sources has several large bursts to transmit it can pipeline its requests. While this class uses the 
FRP protocol that has been proposed in the past, it differs in two important aspects. Firstly, it is 
applicable to only class 5 traffic. Secondly, the present invention makes use of the three UPC 

70 parameters. This traffic class will be used by applications such as file transfers and image retrievals, that 
require a large amount of data to be transferred. Such applications require a high throughput with low 
loss, but can tolerate some delay. Because, bursts can be very large, they can place a large load on the 
network for significant periods of time, leading to large cell loss. Hence, the philosophy here would be 
that rt is better to block the burst at the beginning, than to accept the burst and jeopardize bursts that are 

75 already in progress. This traffic class uses an explicit burst level control, where the UPC parameters are 
negotiated for each n ew burst, using the fast reservation protocol. At the end of the burst the VC is 
quickly timed out, before it can begin a new burst. When the VC requires transmission of another burst, 
it has to make a new request. Transmission of the new burst will commence only after the necessary 
resources have been allocated. If the burst size exceeds the negotiated compliant burst size B c , the 

20 remaining burst is transmitted at the sustained rate X s . The VC however, can make a request, to be able 
to transmit the remaining burst at the peak rate Xp. This class introduces admission delay for each new 
burst. However, once a VC commences transmission, it receives a high throughput delivery at very low 
cell loss. 

Having described a traffic control framework designed to maintain specified QOS levels over a range of 
25 anticipated services, now a traffic control framework will be described which ensures that the specified QOS 
levels are maintained during periods of network overload, both short and long-term. 

At the time of call set up, each VC declares the UPC parameters of its associated source, based on a 
limited knowledge of its traffic characteristics. Assume that the source expects its packet size to be of 
length B c . For example, in most transport protocols, the maximum length of the transport packet data unit 
30 (PDU) is fixed. In such cases one choice would be to set B c equal to the maximum PDU size. Further, 
assume that the transmitter pauses for a time Tpo Usa before the next packet can be transmitted. Let the 
source also request for a peak transmission rate X P . In this instance the sustained rate X s can be chosen 
such that the sustained rate counter attains a value of zero before the next burst commences. That is, 



35 



45 



-^(Aj> — As) = X$Tp a 



40 giving 



Be + XpTpa 



The compliant burst size is set equal to B c . Note that the required sustained rate decreases as the pause 
Tpouso increases. If the sustained rate is chosen too small, the sustained rate counter will not count down to 
zero before the next packet is ready for transmission. This will result in an increase in transmission time for 
the next packet. Choosing a larger value for the sustained rate will not impact the performance of the 
so source, but will result in a lower statistical multiplexing gain, which would reflect in higher transmission 
costs. If an average rate is defined as 



55 " Tp + 7>, 

then, Xa - x s . 
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To provide guarantees of QOS levels, the network must have some knowledge about the characteristics 
of the offered traffic. The UPC parameters are intended to provide this characterization to the network. The 
dual leaky bucket at the source shapes the source traffic based on the negotiated UPC parameters, while 
the policer (or switch controller) at the switch checks for conformance with the declared UPC parameters. 

5 While this shaping process makes the source traffic more predictable, the three UPC parameters do not 
fully capture the characteristics of the source traffic, and hence the output process of the shaper is not 
completely deterministic. In particular, it can only be certain that the source will transmit no more than B c 
cells at the peak rate \p, The remaining two states, namely the time spent in the sustained mode (and thus 
transmitting at the rate X s ), or the time spent in the pause or idle mode (and thus not transmitting), are 

w indeterminate. The time in the pause mode can however be bounded by using a timeout of length To- If the 
pause period exceeds a time T& the corresponding VC or source can be declared idle and the resources 
can be deallocated. 

For statistical multiplexing of bursty sources a model is needed to characterize the source traffic based 
on the declared UPC parameters. The simplest model is a two state model as shown graphically in Figure 

is 5(a), where the source transmits at the peak rate \ P (peak rate mode) for a time T P = B + C/Xp, and 
pauses for a time T s = T min = Tp(k P - X s )/X s (during which time no cells are transmitted), such that the 
average rate \ A over a period T P + T s = \ s . A limitation of this model is that, while the model assumes that 
the source does not transmit during the pause period, it could in fact be transmitting at the sustained rate \ 5 
for an unspecified length of time. Any call or burst admission control policy based on this model can thus 

20 be optimistic and admit more calls or bursts than can be sustained at the required QOS. In order to 
overcome this limitation, a modified two state model such as shown graphically in Figure 5(b) is used, 
where it is assumed that the source can be in one of two modes or states, a peak rate mode or a sustained 
rate mode. In the peak rate mode, which lasts for a time T P = B<Ap. the source transmits B c cells (equal to 
the compliant burst size) at the peak rate X/* This is followed by a transition to a sustained rate mode of 

25 mean length T s = T min = Tp(k P - X s yx s , during which time the source transmits at a rate 0X$(O < jS < 1 ). 
Note that if /S = 0 the result is the original two state model. The modified two state model results in a 
conservative call-burst admission policy for the following reasons: 
Consider the case when 0 = 1: 

1 . the maximum time spent in the peak rate mode is T P = Bcf\ P . This provides an upper bound on the 
30 number of cells that can be transmitted at the peak rate. 

2. After completing the transmission of 0c cells at rate Xp, the actual source can switch to one of three 
states. The source 

(a) Case 1: remains in a pause or idle mode for an indeterminate period of time, during which time it 
does not transmit any cells; 

35 (b) Case 2: transmits at a rate less than or equal to the sustained rate X s for an indeterminate period 

of time; 

(c) Case 3: transmits at some rate less than or equal to the sustained rate X s for some time greater 
than T s assumed by the model, and then switches to an idle mode. 
The modified two state model however assumes that, following the transition from the peak rate mode, 

40 the source remains in the sustained rate mode for a time T s = Tp(k P - X s )/X s transmitting at the sustained 
rate X s , and then switches back to the peak rate mode. If case 1 occurs, then the source is timed out after a 
time T 0 and the resources are deallocated. On the other hand, if the actual source pauses or idles for a 
time /pat*© less than the time out interval Tq, but greater than the time spent in the sustained period T s 
assumed by the model, the impact of the model on the call admission policy will be slightly conservative. 

45 That is, rf it were known that the source actually switched to an idle mode in between the peak rate modes, 
more calls could have been admitted. If the pause or idle period Tp^u*, is shorter than T s , then the 
compliant burst size for the next packet is reduced. That is, fewer cells can now be transmitted at the peak 
rate Xp, Since the model assumes that the compliant burst size is B c , once again the impact of the model 
on the call admission policy is conservative. 

so If case 2 or case 3 occurs, then, in a time T P + 7" s fewer cells are actually transmitted by the source 
than what the model assumes, resulting in a call admission policy that is more conservative. While a 
conservative call admission policy ensure that the desired QOS is not compromised, it results in a reduced 
utilization of the link bandwidth as well. 

Thus, while the UPC parameters cannot precisely characterize the behavior of the sources, it is possible 

55 to define a traffic profile that will result in a call admission policy that is more likely to be conservative (than 
optimistic), but will ensure the QOS is not compromised. The call admission policy can be made more 
optimistic by reducing the value of 0. but at the risk of admitting more calls than is supportable at a given 
QOS. 
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Each switch in the ATM network is modeled as single stage multiplexer with L buffers, and a link 
bandwidth C. Each admitted call / has UPC parameters X P , \ s , B l c . New calls are admitted only if the 
resulting probability of cell loss Pee// will not exceed the given value c. The number of buffers L is assumed 
to be small enough, so that they do not introduce any significant queueing delay. 

5 "The call-burst admission control policy is implemented at the call level as well as at the burst level 
respectively. Assume that there are a maximum J sources, where each source / has UPC parameters X Pf 
\' 3 ,\' c . The call admission policy is based on the modified two state model. Each source or VC is assumed 
to be in one of two states. It can be in an ACTIVE state, or it can be an IDLE state. While in the active state, 
the VC is either in a peak rate mode or a sustained rate mode. In the peak rate mode the VC transmits B c 

10 cells equal to its declared compliant burst size at the peak rate Xp, which lasts for a time Tp. In the 
sustained rate mode the VC is assumed to transmit at a rate £ \$(0 < p S 1). The sustained rate mode lasts 
for a time T s = Trfkp - X s yx s . If the VC pauses (i.e., the VC does not transmit any cells) for a time greater 
than 7*0, the source or VC is declared as being IDLE. All admitted calls require that the cell loss P c&i be 
less than e. Assume that /calls are in progress. Let a request to admit a new call, or a new burst (from a VC 

ts that has been timed out) arrive. Based on the declared UPC parameters of the new call or burst, as well as 
those calls already in progress, the call-burst admission control admits (or rejects) the new call or burst, if 
the resulting cell loss probability P C on is less (greater) than some desired value e. Once the call is admitted, 
if the source or VC associated with the call is idle or inactive for a time greater than 7o,the VC is timed out 
(and declared IDLE) and the resources are deallocated. When the timed out or IDLE VC has a new packet 

20 to send, it is subjected to a burst admission control (BAC). The burst admission control policy either 
restores the resources and admits the burst or blocks the burst, based on the number of VC that are 
currently active and their respective UPC parameters. The new burst from a timed out VC is accepted only 
if the guaranteed cell loss rate Pceii £ * can be maintained. Once the burst admission control accepts a 
packet from a VC that has been declared to be IDLE, succeeding packets from the same VC are admitted 

25 without being subject to the burst level control, unless the VC is timed out again. A VC that is blocked can 
retry after an exponential back-off of length Tbo^ 

Assume that source /(/' = 1,2 J) has UPC parameters X Pf Xi, B' c . The source is modeled as a binary 

source. The source spends a time V P in the peak rate mode, when it transmits B' c cells at the peak rate of 
Xp. Each peak rate mode is followed by a sustained rate mode, which lasts for a time V 3t when it is 

30 assumed to be transmitting at a rate fi\ 8 . 

Thus the time spent in the peak rate mode is given by: 



&c 

35 x 



40 



50 



The time spent in the sustained rate mode is given by: 



3i = ^ (A t" 4) - (2) 



45 Thus the probability P i source / is in the peak rate mode is 



pi ~T i P lT i s ® 



The probability source i is in the sustained rate mode is given by: 



55 O-t" _ ! n\ _ Ts 
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The mean rate of transmission ff of source / is 



20 



30 



35 



40 



IV = P i \)> + p- i X s . (5) 



A call-burst admission control policy that uses the UPC parameters, is simple to implement can handle 
heterogeneous sources, and the estimated cell loss probabilities are close to the observed cell loss 
70 probabilities. The call-burst admission policy is based on a heuristic that estimates the cell loss probability 
with reasonable accuracy. 

If the number of buffers is L and the link capacity is C, the time to empty the L buffers T E = L/C. 
Assume that time is slotted and each time slot is of length r & The number of cells n' P that VC 
will generate during a time slot while in the peak rate mode is T^ P . The number of cells n l 3 that VC i will 
75 generate during a time slot while in the sustained rate mode is T^ 3 . Let nj- be a random variable that 
represents the number of cells VC / will generate during a time slot of length 7V Further, let 



^(j") = E4, (6) 



where NrfA representes the total number of cells that arrive in a time slot of length T E given / VC have been 
admitted. Note that the n' T are random variables. Assume that these Nj{j) cells arrive as a batch at the 
25 beginning of each time slot. Since the multiplexer can serve L cells in every time slot of length T E , if the 
number of cells Njij) that arrive in any time slot is £ L no cells will be lost and the queue will be empty at 
the beginning of each slot. If tMj) > L, then - L will be lost on arrival. Now /Vj(/) is the sum of / random 
variables. That is, if /(/ = 1 ,2,...,«/) call are in progress 



where, 



n)> = T B \)> with probability/* (7) 
= T E y s with probability^- (8) 



and 



45 f*r = n p + n s • 

£From (7 and 8) it is possible to determine the probability distribution function 

so P[tW = m) = P(roj/), 

were P(mj/)is the probability that m cells arrive at the beginning of a time slot, given j calls have been 
admitted. Using a generating function approach, we have 

55 



11 



EP 0 658 999 A2 



10 



50 



55 



/*rto = II( p, " ^n5, + P " ,^n *) 



t=l 



= Co + C\z + C2Z 2 + • • • 

where, P[Nj{j) = m] = Cm. The coefficients are evaluated when 7 calls are in progress. Define cell loss 
probability when j calls are admitted as 



*=oo 

£(*-!)+ J>(*|j) 

Pcett = ^ . (9) 

where, />(*!/) = C k . The numerator gives the average number of cells lost in a time slot. The denominator 
gives the average number of cells that arrive in a time slot. When a new call or burst arrives, the call 
20 admission policy computes the value of Pcea (given by equation 9) as a result of admitting this new call. If 
Pcofh thus computed is less than the target value c, the call or burst is admitted. Otherwise, the call or burst 
is rejected. 

The admission policy based on equation (9), is very conservative, since it assumes that the cells arrive 
according to a batch process. In reality, the arrivals would be spread over the time slot. With the result, the 

25 cell loss probability Pcea estimated by equation (9) will be much higher than the actual cell loss probability. 
This results in the admission of fewer calls or bursts, leading to reduced utilization and large admission 
delays. It is possible however to use equation (9) as a lower bound for accepting calls. 

The call admission policy will be improved as follows. Assume a fluid model where the cells arrive like 
a fluid over the entire length of every time slot (rather than as a batch at the beginning of each time slot). In 

30 this case, if the number of cells NtQ) that arrive over the time slot of length T £ is less than L, not only are no 
cells lost, but no queueing results. On the other hand, if more than 21 cells arrive during a time slot, L cells 
are served, L cells are queued, and the remaining tMJ) - 2L cells are lost. This model however, requires 
tracking of the evolution of the queue to determine the cell loss probability. Since the computation of Pceii 
should be simple so that the cell loss can be computed in real time, the following heuristics are used. 

35 Assume that the multiplexer queue will be in one of two states. The queue is in a state of under-load (during 
which time the queue is empty for most of the time) or it is in a state of overload (during which time the 
queue is full and cells are being tost). Note that if it is assumed that the cell arrival process is a fluid model, 
the queues will be empty as long as the instantaneous arrival rate is less than the link capacity C. Also 
assume that the transitions from an under-load state to an overload state, and vice versa occurs in a very 

40 short time. This would be true if the link speeds are high, and the sources are bursty with high access or 
peak rates. A transition can occur from an empty queue to a full queue (leading to loss) only if the 
combined instantaneous arrival rate is greater than twice the link capacity. That is, cell loss occurs only if 
more than 2L cells arrive in a time slot. Assume that the performance of such a system (with link capacity C 
and L buffers) can be approximated by an equivalent slotted system with link capacity 2C and with 21 

45 buffers, where each time slot T E = 2L/2C - L/C, and the cells arrive as a batch at the beginning of each 
time slot. In such a system, the queue is always empty at the beginning of each time slot. If the number 
cells NjiJ) that arrive at the beginning of a time slot is greater than 21, then Mrf/) - 2L cells are lost. 
Approximate the cell loss in such a system as, 



Jfe=oo 

E(*-2Z,)+P((*L0 
= (10) 

£ kP(k\j) 



JTc 



fc=0 



The cell loss predicted by equation (10) is optimistic, and the call admission policy can admit more 
calls than can be sustained at the required QOS or cell loss. To correct this fact a second constraint is 
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introduced. From Equation (5) the mean rate of transmission of source / is 

5 

The second constraint requires that, after admitting the new call or burst, the combined mean arrival rate 
must be less than the link capacity. That is, 

10 

R = J2&<<*C (0<a<l). (11) 

75 

Thus, given / calls have been admitted, when the {/ + 1)f/? call arrives (or when the (/" + 1)th VC makes a 
transition from an idle to an active state), the cell loss probability Pest) given by equation (10) and the 
combined mean arrival rate R from equation (1 1) are computed. The call or burst is admitted only rf 

20 Pceti < e and R < aC. 

The advantage of this call admission policy is that it is simple, and it can applied to heterogenous traffic 
sources. The coefficients C m = P(m)fi are obtainable in an incremental fashion, each time a new call (or 
burst) is admitted or released (or timed out). Assume that j calls are in progress and the coefficients C m (m 
25 = 0,1,2,...) are known. If the {/ + 1)f/) call or burst is admitted, with model parameters 

pj+! 

30 then the new coefficients are given by: 

35 

By rewriting equation (10), only coefficients from Co to C^. need be determined. 

A comparison of the performance of a UPC based call acceptance policy with a policy that uses only 
peak rate allocation under two different source work-loads was performed. The first workload represents a 
typical distributed computing application while the second workload represents an image retrieval applica- 
40 tion. The performance was measured of a multiplexer running at C = 150 Mbits/sec., with L = 500 buffers. 
Table 2 lists the source workload characteristics. 
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Table 2 



Parameter 


workload i 


wonxioaa d 


Number of sources 


50 




Peak Rate x c 


10, 25, 45 MDltS/SeC 


10, 25, 45 MD ItS/SeC 


ou stained Kate \s 


/.O, 1U, ID MDits/sec 


/ .5, 1 5, md us/sec 


Mean packet size B c 


10 Kbytes 


64 Kbytes 


Mean number of packets per burst 


8 


16 


Mean Interpacket gap Exponential 


3 msec. 


3 msec. 


Mean Interburst gap Exponential 


30 msec. 


3 sec. 


Time out interval 


10 msec. 


10 msec. 


Desired cell loss probability P C eti 


<1E-6 


<1E-6 



The network is modeled as a multiplexer, with a link capacity of 150 Mbits/second, and 500 cell buffers. 
Further, the call admission policy based on equation (10-11) admits the first packet after a timeout only if it 
can ensure that the cell loss probability will be less than 1.0 x 1CT 6 . For the examples shown, £ = 0.5 and 
a = 0.9. For the peak rate allocation scheme, a new packet (after a timeout) is admitted only if the sum of 
the peak rates of all admitted source is less than 150 Mbits/second. Note that in the peak rate allocation 
scheme, the sustained rate is equal to the peak rate. The results for the first workload when the packet size 
is fixed at 10Kbytes are presented in Table 3. The results for the second workload with a fixed packet size 
of 64Kbytes are presented in Table 4. In each case, the first line represents the UPC control scheme and 
the second line represents the peak rate allocation scheme. As shown, the UPC based scheme results in a 
higher link utilization than the peak rate control. 

Table 3 



Peak Rate Mbs 


Sustained 


Cell Loss 


Link Utilization 


Mean 


Mean Admission 


Number of 




Rate Mbs 






Transmission 


Delay (msec) 


Admitted 










Time (msec) 




Sources 


10 


7.5 


<IE-6 


89% 


8.8 


86 


21 


10 


10 


0 


63% 


8.0 


112 


14 


25 


10 


<IE-6 


81% 


6.4 


136 


13 


25 


25 


0 


38% 


3.2 


147 


5 


45 


15 


<IE-6 


53% 


3.9 


270 


6 


45 


45 


0 


28% 


1.8 


162 


3 
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Table 4 



Peak Rate Mbs 


Sustained 


Cell Loss 


Link Utilization 


Mean 


Mean Admission 


Number of 




Rate Mbs 






Transmission 


Delay (msec) 


Admitted 










Time (msec) 




Sources 


10 


7.5 


<IE-6 


68% 


66.1 


27 


18 


10 


10 


0 


41% 


51.2 


28 


14 


25 


10 


<IE-6 


76% 


32.1 


152 


8 


25 


25 


0 


46% 


20.5 


52 




45 


15 


<IE-6 


64% 


18.7 


252 


4 


45 


45 


0 


52% 


11.6 


92 


3 



It has been observed that the UPC based control can achieve significant statistical gains and still ensure 
the promised QOS. While the UPC based control results in higher link utilization in all cases, it also leads to 
longer waiting times when both the packet size and the peak rates are large. Hence, when the packet size 
is large and the required peak rate or thoughput is high, a peak rate allocation scheme on a burst by burst 
basis may be more appropriate. In the traffic control framework described above, applications such as 
image transfers (which have large bursts and require a high throughput), will use class 5 service, which 
provides control on a burst by burst basis. In all other cases, the UPC based control will deliver the 
promised QOS at high link utilization. 

While there has been described and illustrated a new UPC-based traffic control framework for ATM 
network where the control framework is based on pro viding guarantees of the offered QOS for each of five 
aggregated traffic classes, by means of call/burst admission control and traffic shaping of source streams 
using a dual leaky bucket using a single UPC mechanism to address the needs of a wide range of new 
applications in an integrated manner, and a new UPC-based call and burst admission control providing the 
desired QOS by means of a call/burst admission control and traffic shaping of source stream using a dual 
leaky bucket it will be apparent to those skilled in the art that variations and modifications are possible 
without deviating from the broad principle and spirit of the present invention which shall be limited solely by 
the scope of the claims appended hereto. 

Claims 

1. An asynchronous transfer mode (ATM) traffic control method comprising the steps of: 

defining a transmitted burst packet in terms of parameters peak rate (X P ), sustained rate (\$), and 
compliant burst size (B c y, and 

controlling admission of the transmitted burst packet whereby when the packet is not admitted, the 
parameters are changed so that the packet is admitted. 

2. An ATM traffic control method as set forth in claim 1 wherein the parameters are changed responsive 
to parameters from said controlling said admission. 

3. An ATM control method as set forth in claim 2, where said parameters are changed based upon burst 
allocation. 

4. A method as set forth in claim 1 , 2 or 3, where said defining a transmit burst packet includes shaping 
the packet. 

5. An asynchronous transfer mode (ATM) traffic controller comprising: 

burst mode terminal for transmitting burst data having parameters of peak rate (X P ), sustained rate 
(X s ), and compliant burst size (B c ); 

switch control means for receiving said burst data and determining whether to admit said data to a 
network; 

if said switch control means does not admit said data, said burst mode terminal transmits burst 
data having revised parameters of peak rate (X*), sustained rate (X*), and compliant burst size (B* c ). 
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6. An ATM traffic controller as set forth in claim 5, where said switch control means transmits revised 
parameters to said burst mode terminal for causing said burst mode terminal to transmit burst data 
having the revised parameters. 

7. An ATM traffic controller as set forth in claim 6, further comprising ATM switch means for coupling said 
burst data to the network when said switch control means admits said data. 

8. An ATM traffic controller as set forth in claim 7, further comprising a network to which burst data is 
admitted according to the class of said data in order to maintain a desired quality of service. 

9. A controller as set forth in any one of claims 5 to 8, further comprising ATM switch means for coupling 
said burst data to the network when said switch control admits said data. 

10. An ATM traffic controller as set forth in claim 9, further comprising a network to which burst data is 
admitted according to the class of said data in order to maintain a desired quality of service. 

11. A method of admitting calls to a network including L buffers and having a link capacity C comprising 
the steps of: 

receiving data; 

admitting the received data to the network when 



where P(Ac|/) is the probability that k cells arrive during a time slot given j cells have been admitted, 
* is a target value, R is the mean rate of transmission, and 0 < a S 1 . 



oo 




£ kP{k\j) 
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